Selectively Acquiring Customer Information: A New Data Acquisition Problem and an Active Learning-Based Solution

نویسندگان

  • Zhiqiang Zheng
  • Balaji Padmanabhan
چکیده

This paper presents a new information acquisition problem motivated by business applications where customer data has to be acquired with a specific modeling objective in mind. In the last two decades there has been substantial work in two different fields optimal experimental design and machine learning – that has addressed the issue of acquiring data in a selective manner with a specific objective in mind. We show in this paper that the problem presented here is different from the classic model-based data acquisition problems considered thus far in the literature in both fields. Building on the work in optimal experimental design and in machine learning we develop a new active learning technique for the information acquisition problem presented in this paper. We demonstrate that the proposed method perform well based on results from applying this method across twenty Web usage and machine learning datasets.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Selective Data Acquisition for Machine Learning

In many applications, one must invest effort or money to acquire the data and other information required for machine learning and data mining. Careful selection of the information to acquire can substantially improve generalization performance per unit cost. The costly information scenario that has received the most research attention (see Chapter X) has come to be called ”active learning,” and...

متن کامل

On Active Learning for Data Acquisition

Many applications are characterized by having naturally incomplete data on customers – where data on only some fixed set of local variables is gathered. However, having a more complete picture can help build better models. The naïve solution to this problem – acquiring complete data for all customers – is often impractical due to the costs of doing so. A possible alternative is to acquire compl...

متن کامل

Active Feature Acquisition for Classifier Induction

Many induction problems, such as on-line customer profiling, include missing data that can be acquired at a cost, such as incomplete customer information that can be filled in by an intermediary. For building accurate predictive models, acquiring complete information for all instances is often prohibitively expensive or unnecessary. Randomly selecting instances for feature acquisition allows a ...

متن کامل

Solution of Backup Multifacility Location Problem by Considering the Ideal Radius for each Customer

In this paper we introduce a new facility location model, called backup multifacility location problem by considering the ideal radius for each customer. In this problem the location of clients are given in the plane. A radius is assigned to each client. We should find the location of new facilities, which some of them may fail with a given probability, such that the sum of weighted distances f...

متن کامل

Active Learning: An Approach for Reducing Theory-Practice Gap in Clinical Education

Introduction: The gap between theory and practice in clinical fields, including nursing, is one of the main problems that many solutions have been suggested to eliminate it. In this article, we have tried to investigate its solution through active learning. Methods: In this review article, searching articles published during 2000-2012 was done through library references, scientific databases. ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Management Science

دوره 52  شماره 

صفحات  -

تاریخ انتشار 2006